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(57) Abstract 

A communication architecture for performing 
work in a distributed computing environment in- 
volves client processes on client nodes sending work 
requests to a director. The director examines the 
work requests to determine one or more resources 
required to perform the work as well as one or 
more server nodes that can provide the required re- 
sources. The director then forwards the work re- 
quests to server processes on the server nodes that 
can provide the required resources. The work re- 
quests provided to the server processes specify that 
the work results are to be provided directly from the 
server processes to the client processes that made 
the work requests. The director maintains a table 
of resource data that specifies the services and/or re- 
sources that are provided by and/or made available 
on the server nodes in the distributed computing en- 
vironment. 
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COMMUNICATION ARCHITECTURE FOR DISTRIBUTED COMPUTING ENVIRONMENT 

FIELD OF THE INVENTION 
5 The present invention relates to distributed computing, and more specifically, 

to a communication architecture for a distributed computing environment. 

BACKGROUND OF THE INVENTION 

In a distributed computing environment, work can be performed on multiple 
computing entities, often referred to as "nodes." In the context of distributed 

10 computing, the term "work" refers to any processing of data. One example of a node 
is a computer system on which one or more processes or threads may execute. Nodes 
in a distributed computing environment are typically communicatively coupled, for 
example via a network, to allow processes on nodes to access resources on other 
nodes. In a hardware context, a resource is generally a device such as a printer, disk 

1 5 drive or memory. In a software context, a resource is generally an item of data or a 
routine. 

An important concern in distributed computing is how to provide 
communication between processes on different nodes to ensure that work is 
completed in an orderly and efficient manner. Two such prior approaches include the 
20 direct communication architecture and the managed communication architecture. 

DIRECT COMMUNICATION ARCHITECTURE 
According to the direct communication architecture, processes communicate 
directly with processes on other nodes to request that work be performed and to obtain 
25 work results. FIG. 1 illustrates a distributed computing system 100 that uses a direct 
communication architecture approach. Distributed computing system 100 includes 
nodes N1-N5. Nodes Nl and N2 are each communicatively coupled to nodes N3-N5 
via connections that provide for the exchange of data between nodes Nl and N2 and 
nodes N3-N5. Specifically, node Nl is connected to node N3 via connection 102, to 
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node N4 via connection 104 and to node N5 via connection 106. Node N2 is 
connected to node N3 via connection 108, to node N4 via connection 1 10 and to node 
N5 via connection 112. 

A node having processes that require work to be performed is referred to 
5 herein as a "client" node. Processes on client nodes that require work to be performed 
are client processes. A node having processes that provide a service to and/or 
perform work on behalf of processes on other nodes is referred to herein as a "server" 
node. Processes on server nodes that provide a service and/or perform work on 
behalf of client processes are server processes. Sometimes nodes have both client 
1 0 processes and server processes executing thereon and therefore the "client node" and 
"server node" terminology is used herein to aid in understanding prior approaches and 
embodiments of the invention. 

In distributed computing system 100, nodes Nl and N2 are client nodes while 
nodes N3-N5 are server nodes. When a particular client process on client node N 1 
1 5 requires work to be performed on server node N3, the particular client process sends a 
request directly to a server process on node N3 over connection 1 02. The request may 
be in any form, for example a remote procedure call (RPC). The process on server 
node N3 performs the requested work and provides the results directly to the 
particular client process. Work is similarly performed on server nodes N3-N5 on 
20 behalf of client processes on client node N2. 

According to the direct communication architecture approach, client processes 
communicate directly with server processes that can perform work for the client 
processes. A disadvantage of this approach is that client processes must know which 
server process can perform the required work. When the services offered by any of 
25 the server processes change, the client nodes must be made aware of the change. 

Similarly, when a new service is provided, the client processes must be made aware of 
the new server node. The services offered by a server process can change when either 
the operations performed by the server process change or when the resources that a 
server process operates on change. For distributed computing systems with a large 
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number of client processes and server nodes, a significant amount of system resources 
can be required to keep all of the client processes updated. 

MANAGED COMMUNICATION ARCHITECTURE 
5 According to the managed communication architecture approach, client 

processes send work requests to a management process referred to herein as a 
"director." The director forwards the work requests to the appropriate server process 
that can perform the required work. After performing the work, the server processes 
send the work results to the director, which forwards the work results to the client 

1 0 process that made the original work request. 

FIG. 2 illustrates a distributed computing system 200 that uses a managed 
communication architecture approach. Distributed computng system 200 includes 
client nodes Nl and N2 and server nodes N3-N5. Client nodes NI and N2 are each 
communicatively coupled to a director 202 via connections 204 and 206, respectively. 

1 5 Client processes on client nodes N 1 and N2 submit work requests to director 202 via 
connections 204 and 206. Director 202 determines which of the server nodes N3-N5 
can perform the requested work and forwards the work requests to the appropriate 
server process on server node N3-N5 on connections 208, 210 and 212, respectively. 
After the appropriate server process has performed the requested work, the server 

20 process provides the work results to director 202. Director 202 then forwards the 
work results to the client process that requested the work. 

According to the managed communication architecture approach, director 202 
keeps track of the services provided by server nodes N3-N5. As a result, when a 
change in services offered by server nodes N3-N5 occurs, only director 202 has to be 

25 informed and not client processes on client nodes Nl and N2. This reduces the 

amount of system resources required to keep track of changes in services offered by 
server nodes N3-N5, particularly when there are a large number of server nodes. 
However, the managed communication architecture approach requires more system 
resources than the direct communication architecture to manage the additional 
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messaging traffic associated with director 202. For example, for a single work 
request, the direct communication architecture approach requires two messages, one 
message from a client process to a server process and one message from the server 
process back to the client process. To process the same work request; the managed 

5 communication architecture requires four messages, one from the client process to the 
director, one from the director to the server process, one from the server process back 
to the director and one from the director back to the client process. 

Based on the need to perform work in a distributed computing environment 
and the limitations in the prior communication architectures, a communication 

10 architecture for a distributed computing environment that avoids the limitations in the 
approaches previously described is highly desirable. 



SUMMARY OF THE INVENTION 

According to one aspect of the invention, a method is provided for processing 

15 data on a distributed computing system that includes a plurality of nodes. The method 
comprises the steps of, in response to receiving a first work request to perform first 
work from a first process on a first node, determining based upon the first work that 
the first work is to be performed on a second node and providing the first work 
request to a second process on the second node. The first work request specifies that 

20 the first process is to receive results of the first work directly from the second process. 
According to another aspect of the invention, a method is provided for 
processing data on a distributed computing system that includes a plurality of nodes. 
A director receives a first remote procedure call from a first client process on a first 
client node. The first remote procedure call requests that first work be performed and 

25 that results of the first work be provided directly to the first client process. The 
director examines the first remote procedure call and determines that a first server 
process on a first server node is to perform the first work. Then the director provides 
the first remote procedure call to the first server process. 
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According to another aspect of the invention, a distributed computing system 
for performing work comprises a plurality of nodes and a director communicatively 
coupled to the plurality of nodes. The director is configured to, in response to a first 
work request to perform first work from a first process on a first node, determine that 
5 the first work is to be performed on a second node from the plurality of nodes. The 
director is further configured to request that the first work be performed by a second 
process on the second node and that first results of the first work be provided from the 
second process directly to the first process. 

1 0 BRIEF DESCRIPTION OF THE DRAWINGS 

Embodiments of the invention are illustrated by way of example, and not by 
way of limitation, in the figures of the accompanying drawings and in which like 
reference numerals refer to similar elements and in which: 

FIG. 1 is a block diagram illustrating a conventional distributed computing 
15 system that uses a direct communication architecture to perform work; 

FIG. 2 is a block diagram illustrating a conventional distributed computing 
system that uses a managed communication architecture to perform work; 

FIG. 3 is a block diagram illustrating a distributed computing system using a 
communication architecture to perform work according to an embodiment of the 
20 invention; 

FIG. 4 is a flow chart illustrating an approach for performing work in a 
distributed computing system according to an embodiment of the invention; 

FIG. 5 is a resource data table according to an embodiment of the invention; 

and 

25 FIG. 6 is a block diagram of a computer system on which embodiments of the 

invention may be implemented. 
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DETAILED DESCRIPTION OF THE INVENTION 

In the following description, for the purposes of explanation, specific details 
are set forth in order to provide a thorough understanding of the invention. However, 
it will be apparent that the invention may be practiced without these specific details. 
5 In other instances, well-known structures and devices are depicted in block diagram 
form in order to avoid unnecessarily obscuring the invention. 

FUNCTIONAL OVERVIEW 
In general, the communication architecture for performing work in a 
distributed computing environment, according to an embodiment of the invention, 
involves client processes on client nodes sending work requests to a director. The 
director examines the work requests to determine resources and/or services that are 
required to perform the requested work. The director then forwards the work requests 
to server processes on server nodes that can perform the requested work. The work 
requests provided to the server processes specify that work results are to be provided 
directly from the server processes to the client processes that made the work requests. 
Consequently, server processes provide their work results directly to the client 
processes that requested the work, thereby bypassing the director. This approach 
eliminates the need for client processes to be aware of the services provided by the 
server processes and nodes while requiring relatively fewer system resources than 
conventional managed communication architectures. 

DETAILED DESCRIPTION 
FIG. 3 illustrates a distributed computing system 300 that uses a 
communication architecture for performing work according to an embodiment of the 
invention. Distributed computing system 300 includes client nodes Nl and N2 and 
25 server nodes N3-N5. Client nodes N 1 and N2 are each communicatively coupled to a 
director 302 via connections 304 and 306, respectively. Client processes on client 
nodes Nl and N2 submit work requests to director 302 via connections 304 and 306. 
Director 302 determines which of the server nodes N3-N5 can perform the requested 
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work and forwards the work requests to a server process on one or more of server 
nodes N3-N5 on connections 308, 3 1 0 and 3 1 2, respectively. Director 302 then 
forwards the work request to the particular server process. The work requests specify 
where the results of performing the requested work are to be sent, i.e. to client node 
5 Nl or N2. Director 302 is described in more detail hereinafter. 

After the server processes on server nodes N3-N5 have performed the 
requested work, server nodes N3-N5 provide the work results directly to the client 
processes on client nodes Nl and N2 that made the work request, bypassing director 
302. Specifically, server processes on server node N3 provide work results to client 

10 processes on client nodes N 1 and N2 via connections 3 1 4 and 3 1 6, respectively. 

Server processes on server node N4 provide work results to client processes on client 
nodes Nl and N2 via connections 3 1 8 and 320, respectively. Server processes on 
server node N5 provide work results to client processes on client nodes Nl and N2 via 
connections 322 and 324, respectively. 

15 The approach for performing work on a distributed computing system is now 

described with reference to both the block diagram of FIG. 3 and the flow chart 400 
of FIG. 4. Referring to FIG. 4, after starting in step 402, in step 404 a requestor, e.g. a 
client process on client node N 1 , sends a work request to director 302 over connection 
304 or 306. In step 406, director 302 examines the work request and determines 

20 which of the server nodes N3-N5 can perform the requested work. According to one 
embodiment of the invention, director 302 examines the work request to identify the 
service required to perform the requested work, where a service is a particular 
operation on a particular resource. Director 302 then compares the required service to 
services supported by server nodes N3-N5 to determine which server node N3-N5 can 

25 provide the required service. 

According to another embodiment of the invention, director 302 determines 
which of the server nodes N3-N5 can provide the required service by determining the 
particular server process on server nodes N3-N5 that holds the exclusive rights (i.e. a 
lock) to a resource required by the service. 
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In step 408, director 302 sends the work request to a server process on one or 
more of server nodes N3-N5 that can perform the requested work. The work request 
sent to the server process on one or more of server nodes N3-N5 specifies that the 
work results are to be sent directly to the client process on client node N 1 or N2 that 
5 requested the work be performed. 

For example, suppose a client process on client node Nl sends a particular 
work request to perform particular work to director 302 over connection 304. 
Director 302 examines the particular work request and determines that a server 
process on server node N3 can perform the particular work. Director 302 forwards 

10 the particular work request to the server process on server node N3. The particular 
work request sent to the to the server process on server nodes N3 specifies that the 
results of the particular work are to be sent to the client process on client node Nl . 

The particular work request sent by the client process on client node Nl may 
specify that the client process on client node Nl is to receive the results of the 

1 5 particular work. According to another embodiment of the invention, the particular 
work request from client node Nl does not specify that client node N 1 is to receive 
the results of the particular work. In this situation, director 302 determines where the 
work request originated, i.e. client node Nl, and generates a new particular work 
request that specifies where the results of the particular work are to be sent, i.e. the 

20 client process on client node N 1 . 

In situations where the requested work requires resources on more than one 
server node, a work request is sent to a server process on each server node where 
work is to be performed. Each work request specifies the client node where the work 
25 results are to be sent. For example, suppose for the particular work request from 
client node Nl that director 302 determines that both server nodes N3 and N4 are 
required to perform the particular work. Director 302 provides a work request to 
server processes on both server node N3 and N4. Both work requests specify that the 
client process on client node N 1 is to receive the results of the particular work. 
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In step 410 the work is performed at the server nodes N3-N5 that received the 
work request. In step 412, the work results are provided from the server processes on 
server nodes N3-N5 that performed the work directly to the client process on client 
node N 1 or N2 that requested the work on connection 3 14, 3 1 6, 3 1 8, 320, 322 or 324. 
5 For example, if a client process on client node Nl had requested the work and director 
302 sent the work request to server process on server node N3, then the server process 
on server node N3 provides the work results to a client process on client node N 1 on 
connection 314. If the work request was sent to a server process on server nodes N4 
and N5, then the work results are provided to client node Nl on connections 3 1 8 and 
1 0 322, respectively. The process is complete in step 414. 

THE DIRECTOR 

Director 302 may be implemented on one or more client nodes Nl and N2, 
server nodes N3-N5, or one or more other nodes (not illustrated) in distributed 

1 5 computing system 300. More than one director may be used in a distributed 

computing system. For example, in a distributed computing system having a large 
number of client nodes, using several directors can improve system performance by 
reducing the number of work requests that are processed by a single director. For 
example, in a distributed computing system having thirty (30) client nodes and five 

20 (5) server nodes, three (3) directors may be used. Each director services ten (10) 
client nodes and provides work requests to all five (5) server nodes. 

As previously described, director 302 keeps track of the services provided by 
server process executing on server nodes N3-N5 so that work requests can be 
forwarded to the correct server node N3-N5. According to one embodiment of the 

25 invention, director 302 maintains the global state of server nodes N3-N5 and the 

server processes executing thereon. Then director 302 implements logic to determine 
which server process holds the exclusive rights to a resource required to process a 
work request. 
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FIG. 5 is a table 500 of resource data maintained by director 302 according to 
an embodiment of the invention. Table 500 includes entries 502, 504 and 506 for 
each server node N3-N5 in distributed computing system 300. Each entry 502, 504 
and 506 specifies the server node that the entry 502, 504 and 506 is associated with 

5 (SERVER NODE ID) and data that specifies the services and/or resources provided 
by and/or available on the associated server node (SERVER NODE SERVICE 
DATA). For example, entry 502 is associated with server node N3 and contains data 
(DATA 1) that specifies the services provided by server node N3. 

Any change in services provided by server nodes N3-N5 is reflected in table 

10 500. In addition, if a new server node (not illustrated) is added to distributed 
computing system 300, then a new entry is added to table 500 for the new server 
node. Similarly, if a server node N3-N5 is removed from distributed computing 
system 300, then the entry 502, 504, 506 associated with the removed server node N3- 
N5 is removed from table 500. 

15 

IMPLEMENTATION MECHANISMS 
The communication architecture for performing work in a distributing 
computing system as described herein may be implemented in discrete hardware 
circuitry, in computer software, or a combination of discrete hardware circuitry and 

20 computer software. The communication architecture described herein in accordance 
with the present invention is not limited to a particular type of messaging protocol. 
For example, node-to-node communication may be implemented with remote 
procedure calls (RPCs) or any other messaging protocol for requesting that work be 
performed on a remote node. 

25 FIG. 6 is a block diagram that illustrates a computer system 600 upon which an 

embodiment of the invention may be implemented. Computer system 600 includes a 
bus 602 or other communication mechanism for communicating information, and a 
processor 604 coupled with bus 602 for processing information. Computer system 600 
also includes a main memory 606, such as a random access memory (RAM) or other 
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dynamic storage device, coupled to bus 602 for storing information and instructions to 
be executed by processor 604. Main memory 606 also may be used for storing 
temporary variables or other intermediate information during execution of instructions 
to be executed by processor 604, Computer system 600 farther includes a read only 
5 memory (ROM) 608 or other static storage device coupled to bus 602 for storing static 
information and instructions for processor 604. A storage device 610, such as a 
magnetic disk or optical disk, is provided and coupled to bus 602 for storing 
information and instructions. 

Computer system 600 may be coupled via bus 602 to a display 6 1 2, such as a 

10 cathode ray tube (CRT), for displaying information to a computer user. An input device 
614, including alphanumeric and other keys, is coupled to bus 602 for communicating 
information and command selections to processor 604. Another type of user input 
device is cursor control 616, such as a mouse, a trackball, or cursor direction keys for 
communicating direction information and command selections to processor 604 and for 

15 controlling cursor movement on display 612. This input device typically has two 
degrees of freedom in two axes, a first axis (e.g., x) and a second axis (e.g., y), that 
allows the device to specify positions in a plane. 

The invention is related to the use of computer system 600 for implementing a 
communication architecture for a distributed computing system. According to one 

20 embodiment of the invention, a communication architecture for a distributed 

computing system is provided by computer system 600 in response to processor 604 
executing one or more sequences of one or more instructions contained in main 
memory 606. Such instructions may be read into main memory 606 from another 
computer-readable medium, such as storage device 610. Execution of the sequences 

25 of instructions contained in main memory 606 causes processor 604 to perform the 
process steps described herein. One or more processors in a multi -processing 
arrangement may also be employed to execute the sequences of instructions contained 
in main memory 606. In alternative embodiments, hard-wired circuitry may be used 
in place of or in combination with software instructions to implement the invention. 
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Thus, embodiments of the invention are not limited to any specific combination of 
hardware circuitry and software. 

The term "computer-readable medium" as used herein refers to any medium that 
participates in providing instructions to processor 604 for execution. Such a medium 
5 may take many forms, including but not limited to, non-volatile media, volatile media, 
and transmission media. Non-volatile media includes, for example, optical or magnetic 
disks, such as storage device 610. Volatile media includes dynamic memory, such as 
main memory 606. Transmission media includes coaxial cables, copper wire and fiber 
optics, including the wires that comprise bus 602. Transmission media can also take the 

10 form of acoustic or light waves, such as those generated during radio wave and infrared 
data communications. 

Common forms of computer-readable media include, for example, a floppy 
disk, a flexible disk, hard disk, magnetic tape, or any other magnetic medium, a CD- 
ROM, any other optical medium, punch cards, paper tape, any other physical medium 

1 5 with patterns of holes, a RAM, a PROM, and EPROM, a FLASH-EPROM, any other 
memory chip or cartridge, a carrier wave as described hereinafter, or any other 
medium from which a computer can read. 

Various forms of computer readable media may be involved in carrying one or 
more sequences of one or more instructions to processor 604 for execution. For 

20 example, the instructions may initially be carried on a magnetic disk of a remote 

computer. The remote computer can load the instructions into its dynamic memory and 
send the instructions over a telephone line using a modem. A modem local to computer 
system 600 can receive the data on the telephone line and use an infrared transmitter to 
convert the data to an infrared signal. An infrared detector coupled to bus 602 can 

25 receive the data carried in the infrared signal and place the data on bus 602. Bus 602 
carries the data to main memory 606, from which processor 604 retrieves and executes 
the instructions. The instructions received by main memory 606 may optionally be 
stored on storage device 610 either before or after execution by processor 604. 
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Computer system 600 also includes a communication interface 618 coupled to 
bus 602. Communication interface 618 provides a two-way data communication 
coupling to a network link 620 that is connected to a local network 622. For example, 
communication interface 618 may be an integrated services digital network (ISDN) 

5 card or a modem to provide a data communication connection to a corresponding type 
of telephone line. As smother example, communication interface 618 may be a local 
area network (LAN) card to provide a data communication connection to a compatible 
LAN. Wireless links may also be implemented. In any such implementation, 
communication interface 618 sends and receives electrical, electromagnetic or optical 

1 0 signals that carry digital data streams representing various types of information. 

Network link 620 typically provides data communication through one or more 
networks to other data devices. For example, network link 620 may provide a 
connection through local network 622 to a host computer 624 or to data equipment 
operated by an Internet Service Provider (ISP) 626. ISP 626 in turn provides data 

1 5 communication services through the world wide packet data communication network 
now commonly referred to as the "Internet" 628. Local network 622 and Internet 628 
both use electrical, electromagnetic or optical signals that carry digital data streams. 
The signals through the various networks and the signals on network link 620 and 
through communication interface 618, which carry the digital data to and from 

20 computer system 600, are exemplary forms of carrier waves transporting the 
information. 

Computer system 600 can send messages and receive data, including program 
code, through the network(s), network link 620 and communication interface 618. In 
the Internet example, a server 630 might transmit a requested code for an application 
25 program through Internet 628, ISP 626, local network 622 and communication 

interface 61 8. In accordance with the invention, one such downloaded application 
provides for implementation of a communication architecture for a distributed 
computing system as described herein. 
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The received code may be executed by processor 604 as it is received, and/or 
stored in storage device 61 0, or other non-volatile storage for later execution. In this 
manner, computer system 600 may obtain application code in the form of a carrier 
wave. 

5 The communication architecture described herein provides several advantages 

over prior communication architectures for distributed computing environments. 
First, the use of a director to forward work requests to server nodes eliminates the 
need for client nodes to know what services are provided by client nodes. As a result, 
client nodes do not have to be made aware of changes in the services offered by server 

10 nodes. In addition, the approach requires fewer message transfers to perform work 
and provide work results to a client process than the managed communication 
architecture approach since work results are provided by server nodes directly to 
client nodes. For example, to process a work request, embodiments of the present 
invention require three messages instead of four messages as required by managed 

15 communication architecture approaches. Specifically, the three messages include one 
from a client process to the director, one from the director to a server process and one 
from the server process back to the client process. 

In the foregoing specification, the invention has been described with reference 
to specific embodiments thereof. It will, however, be evident that various 

20 modifications and changes may be made thereto without departing from the broader, 
spirit and scope of the invention. The specification and drawings are, accordingly, to 
be regarded in an illustrative rather than a restrictive sense. 



WO 00/65441 



PCT/US00/10426 



15 

CLAIMS 

What is claimed is: 



1 1 . A method for processing data on a distributed computing system that includes 

2 a plurality of nodes, the method comprising the steps of: 

3 in response to receiving a first work request to perform first work from a first 

4 process on a first node from the plurality of nodes, determining based 

5 upon the first work that the first work is to be performed on a second 

6 node from the plurality of nodes; and 

7 providing the first work request to a second process on the second node, 

8 wherein the first work request specifies that the first process is to 

9 receive results of the first work directly from the second process. 

1 2. The method as recited in Claim 1 , further including the steps of 

2 in response to receiving a second request to perform second work from the 

3 first process, determining that the second work is to be performed on a 

4 third node from the plurality of nodes, and 

5 providing the second request to a third process on the third node, wherein the 

6 second request specifies that the first process is to receive results of the 

7 second work directly from the third process. 

1 3. The method as recited in Claim 1 , further including the steps of 

2 in response to receiving a second request to perform second work from a third 

3 process on a third node from the plurality of nodes, determining that 

4 the second work is to be performed on the second node, and 

5 providing the second request to the second process, wherein the second 

6 request specifies that the third process is to receive results of the 

7 second work directly from the second process. 



1 
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1 4. The method as recited in Claim 1 , further including the steps of 

2 in response to receiving a second request to perform second work from a third 

3 process on a third node from the plurality of nodes, determining a 

4 fourth node from the plurality of nodes on which the second work is to 

5 be performed, and 

6 providing the second request to a fourth process on the fourth node, wherein 

7 the second request specifies that the third process is to receive results 

8 of the second work directly from the fourth process. 

1 5. The method as recited in Claim 1 , further including the steps of 

2 determining that the first work is also to be performed on a third node from the 

3 plurality of nodes, and 

4 providing a second request to a third process on the third node, wherein the 

5 second request specifies that results of the first work performed on the 

6 third node are to be provided from the third node directly to the first 

7 process. 

1 6. The method as recited in Claim 1 , wherein the step of determining that the 

2 first work is to be performed on a second node includes the step of 

3 determining one or more resources required to perform the first work, and 

4 determining which of the plurality of nodes is allowed to perform the first 

5 work on the one or more resources. 

1 7. The method as recited in Claim 1 , wherein: 

2 the step of determining that the first work is to be performed on a second node 

3 from the plurality of nodes includes the step of a director determining 

4 that the first work is to be performed on a second node from the 

5 plurality of nodes, and 

6 the step of providing the first work request to a second process on the second 

7 node includes the step of the director providing the first work request 

8 to a second process on the second node. 
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1 8. The method as recited in Claim 1 , further comprising the step of upon 

2 completion of the first work, the second process providing the results of the 

3 first work directly to the first process. 

1 9. The method as recited in Claim 1 , wherein the first work request is a remote 

2 procedure call. 

1 10. The method as recited in Claim 1 , further including the step of generating an 

2 updated first work request that specifies that the first process is to receive the 

3 first results of performing the first work and wherein the step of providing the 

4 first work request to a second process includes the step of providing the 

5 updated first work request to the second process. 

1 11. A method for processing data on a distributed computing system that includes 

2 a plurality of nodes, the method comprising the steps of: 

3 a director receiving a first remote procedure call from a first client process on 

4 a first client node from the plurality of nodes, wherein the first remote 

5 procedure call requests that first work be performed and that results of 

6 the first work be provided directly to the first client process; 

7 the director examining the first remote procedure call and determining that a 

8 first server process on a first server node from the plurality of nodes is 

9 to perform the first work; and 

1 0 the director providing the first remote procedure call to the first server process. 

1 12. The method as recited in Claim 1 1 , fiirther including the steps of 

2 the director determining one or more resources required to perform the first 

3 work, and 

4 the director determining which of the plurality of nodes have permission to 

5 perform a requested operation on the one or more resources required to 

6 perform the first work. 



WO 00/65441 PCT/US00/1 0426 



18 

1 1 3. The method as recited in Claim 1 2, wherein the step of the director 

2 determining the resources available on the plurality of nodes includes the 

3 director examining resource data associated with the plurality of nodes. 

1 1 4. The method as recited in Claim 1 1 , further including the steps of 

2 the director receiving a second remote procedure call from the first client 

3 process, wherein the second remote procedure call requests that second 

4 work be performed and that results of the second work be provided 

5 directly to the first client process, 

6 the director examining the second remote procedure call and determining that 

7 a second server process on a second server node from the plurality of 

8 nodes is to perform the second work, and 

9 the director providing the second remote procedure call to the second server 
10 process. 

1 15. The method as recited in Claim 1 1 , further including the steps of 

2 the director receiving a second remote procedure call from a second client 

3 process on a second client node in the plurality of nodes, wherein the 

4 second remote procedure call requests that second work be performed 

5 and that results of the second work be provided directly to the second 

6 client process, 

7 the director examining the second remote procedure call and determining that 

8 a second server process on a second server node from the plurality of 

9 nodes is to perform the second work, and 

10 the director providing the second remote procedure call to the second server 

1 1 process. 

1 16. A distributed computing system for performing work, the distributed 

2 computing system comprising: 

3 a plurality of nodes; and 
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4 a director communicatively coupled to the plurality of nodes, the director 

5 being configured to 

6 in response to a first work request to perform first work from a first 

7 process on a first node from the plurality of nodes, determine 

8 that the first work is to be performed on a second node from the 

9 plurality of nodes, and 

10 request that the first work be performed by a second process on the 

1 1 second node, wherein the request specifies that first results of 

12 the first work be provided from the second process directly to 

13 the first process. 

1 1 7. The distributed computing system as recited in Claim 1 6, wherein the director 

2 is further configured to provide the first work request to the second process. 

1 1 8. The distributed computing system as recited in Claim 1 6, wherein the director 

2 is further configured to 

3 generate a second work request to requests that the second process perform the 

4 first work and provide the first results directly to the first process, and 

5 provide the second work request to the second process. 

1 1 9. The distributed computing system as recited in Claim 1 6, further comprising 

2 resource data that specifies the access rights of the plurality of nodes relative 

3 to resources. 

1 20. A computer-readable medium carrying one or more sequences of one or more 

2 instructions for processing data on a distributed computing system that 

3 includes a plurality of nodes, the one or more sequences of one or more 

4 instructions include instructions which, when executed by one or more 

5 processors, cause the one or more processors to perform the steps of: 

6 in response to receiving a first work request to perform first work from a first 

7 process on a first node from the plurality of nodes, determining based 
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8 upon the first work that the first work is to be performed on a second 

9 node from the plurality of nodes; and 

1 0 providing the first work request to a second process on the second node, 

1 1 wherein the first work request specifies that the first process is to 

1 2 receive results of the first work directly from the second process. 

1 21. The computer-readable medium as recited in Claim 20, ftxrther including the 

2 steps of 

3 in response to receiving a second request to perform second work from the 

4 first process, determining that the second work is to be performed on a 

5 third node from the plurality of nodes, and 

6 providing the second request to a third process on the third node, wherein the 

7 second request specifies that the first process is to receive results of the 

8 second work directly from the third process. 

1 22. The computer-readable medium as recited in Claim 20, further including the 

2 steps of 

3 in response to receiving a second request to perform second work from a third 

4 process on a third node from the plurality of nodes, determining that 

5 the second work is to be performed on the second node, and 

6 providing the second request to the second process, wherein the second 

7 request specifies that the third process is to receive results of the 

8 second work directly from the second process. 

1 23. The computer-readable medium as recited in Claim 20, further including the 

2 steps of 

3 in response to receiving a second request to perform second work from a third 

4 process on a third node from the plurality of nodes, determining a 

5 fourth node from the plurality of nodes on which the second work is to 

6 be performed, and 
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7 providing the second request to a fourth process on the fourth node, wherein 

8 the second request specifies that the third process is to receive results 

9 of the second work directly from the fourth process. 

1 24. The computer-readable medium as recited in Claim 20, further including the 

2 steps of 

3 determining that the first work is also to be performed on a third node from the 

4 plurality of nodes, and 

5 providing a second request to a third process on the third node, wherein the 

6 second request specifies that results of the first work performed on the 

7 third node are to be provided from the third node directly to the first 

8 process. 

1 25. The computer-readable medium as recited in Claim 20, wherein the step of 

2 determining that the first work is to be performed on a second node includes 

3 the step of 

4 determining one or more resources required to perform the first work, and 

5 determining which of the plurality of nodes is allowed to perform the first 

6 work on the one or more resources. 

1 26. The computer-readable medium as recited in Claim 20, wherein: 

2 the step of determining that the first work is to be performed on a second node 

3 from the plurality of nodes includes the step of a director determining 

4 that the first work is to be performed on a second node from the 

5 plurality of nodes, and 

6 the step of providing the first work request to a second process on the second 

7 node includes the step of the director providing the first work request 

8 to a second process on the second node. 
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1 27. The computer-readable medium as recited in Claim 20, further comprising the 

2 step of upon completion of the first work, the second process providing the 

3 results of the first work directly to the first process. 

1 28. The computer-readable medium as recited in Claim 20, wherein the first work 

2 request is a remote procedure call. 

1 29. The computer-readable medium as recited in Claim 20, further including the 

2 step of generating an updated first work request that specifies that the first 

3 process is to receive the first results of performing the first work and wherein 

4 the step of providing the first work request to a second process includes the 

5 step of providing the updated first work request to the second process. 
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